AITopics | lower-dimensional space

Collaborating Authors

lower-dimensional space

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

High-Dimensional Statistical Process Control via Manifold Fitting and Learning

Tas, Burak I., del Castillo, Enrique

arXiv.org Machine LearningSep-25-2025

We address the Statistical Process Control (SPC) of high-dimensional, dynamic industrial processes from two complementary perspectives: manifold fitting and manifold learning, both of which assume data lies on an underlying nonlinear, lower dimensional space. We propose two distinct monitoring frameworks for online or 'phase II' Statistical Process Control (SPC). The first method leverages state-of-the-art techniques in manifold fitting to accurately approximate the manifold where the data resides within the ambient high-dimensional space. It then monitors deviations from this manifold using a novel scalar distribution-free control chart. In contrast, the second method adopts a more traditional approach, akin to those used in linear dimensionality reduction SPC techniques, by first embedding the data into a lower-dimensional space before monitoring the embedded observations. We prove how both methods provide a controllable Type I error probability, after which they are contrasted for their corresponding fault detection ability. Extensive numerical experiments on a synthetic process and on a replicated Tennessee Eastman Process show that the conceptually simpler manifold-fitting approach achieves performance competitive with, and sometimes superior to, the more classical lower-dimensional manifold monitoring methods. In addition, we demonstrate the practical applicability of the proposed manifold-fitting approach by successfully detecting surface anomalies in a real image dataset of electrical commutators.

control chart, deviation, manifold, (17 more...)

arXiv.org Machine Learning

2509.1982

Country:

North America > United States > Tennessee (0.25)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Pennsylvania > Centre County > University Park (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Adaptive Linear Embedding for Nonstationary High-Dimensional Optimization

Wen, Yuejiang, Franzon, Paul D.

arXiv.org Machine LearningMay-19-2025

Bayesian Optimization (BO) in high-dimensional spaces remains fundamentally limited by the curse of dimensionality and the rigidity of global low-dimensional assumptions. While Random EMbedding Bayesian Optimization (REMBO) mitigates this via linear projections into low-dimensional subspaces, it typically assumes a single global embedding and a stationary objective. In this work, we introduce Self-Adaptive embedding REMBO (SA-REMBO), a novel framework that generalizes REMBO to support multiple random Gaussian embeddings, each capturing a different local subspace structure of the high-dimensional objective. An index variable governs the embedding choice and is jointly modeled with the latent optimization variable via a product kernel in a Gaussian Process surrogate. This enables the optimizer to adaptively select embeddings conditioned on location, effectively capturing locally varying effective dimensionality, nonstationarity, and heteroscedasticity in the objective landscape. We theoretically analyze the expressiveness and stability of the index-conditioned product kernel and empirically demonstrate the advantage of our method across synthetic and real-world high-dimensional benchmarks, where traditional REMBO and other low-rank BO methods fail. Our results establish SA-REMBO as a powerful and flexible extension for scalable BO in complex, structured design spaces.

artificial intelligence, machine learning, optimization, (18 more...)

arXiv.org Machine Learning

2505.11281

Country: North America > United States > North Carolina > Wake County > Raleigh (0.04)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Sustainable Visions: Unsupervised Machine Learning Insights on Global Development Goals

García-Rodríguez, Alberto, Núñez, Matias, Pérez, Miguel Robles, Govezensky, Tzipe, Barrio, Rafael A., Gershenson, Carlos, Kaski, Kimmo K., Tagüeña, Julia

arXiv.org Artificial IntelligenceSep-18-2024

The United Nations 2030 Agenda for Sustainable Development outlines 17 goals to address global challenges. However, progress has been slower than expected and, consequently, there is a need to investigate the reasons behind this fact. In this study, we used a novel data-driven methodology to analyze data from 107 countries (2000$-$2022) using unsupervised machine learning techniques. Our analysis reveals strong positive and negative correlations between certain SDGs. The findings show that progress toward the SDGs is heavily influenced by geographical, cultural and socioeconomic factors, with no country on track to achieve all goals by 2030. This highlights the need for a region specific, systemic approach to sustainable development that acknowledges the complex interdependencies of the goals and the diverse capacities of nations. Our approach provides a robust framework for developing efficient and data-informed strategies, to promote cooperative and targeted initiatives for sustainable progress.

artificial intelligence, machine learning, sdg, (17 more...)

arXiv.org Artificial Intelligence

2409.12427

Country:

South America > Uruguay (0.04)
North America > Mexico > Mexico City > Coyoacan (0.04)
North America > Haiti (0.04)
(101 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

Add feedback

Uncovering Hidden Meaning: A Beginner's Guide to Latent Semantic Analysis

#artificialintelligenceDec-31-2022, 09:50:18 GMT

If you have ever worked with text data, you have likely encountered the challenge of dealing with high-dimensional and sparse data. One popular solution to this problem is latent semantic analysis (LSA), also known as latent semantic indexing (LSI). LSA is a technique for extracting latent (hidden) semantics from a collection of documents or text data. It does this by mapping the documents into a lower-dimensional space, where the relationships between the documents and the underlying concepts they represent can be more easily understood. One of the key benefits of LSA is that it can handle large amounts of data efficiently and is robust to noise and sparse data.

latent semantic analysis, lower-dimensional space, natural language processing, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Uncovering the Essence of Principle Component Analysis: A Comprehensive Guide

#artificialintelligenceDec-30-2022, 09:41:28 GMT

Principal component analysis (PCA) is a popular statistical technique for reducing the dimensionality of a dataset while preserving important patterns and relationships in the data. At its core, PCA is a linear transformation method that projects the data onto a lower-dimensional space, revealing the underlying structure of the data. But what exactly is PCA and how does it work? In this article, we'll delve into the fundamentals of PCA and explore its applications in a variety of fields, including machine learning, data visualization, and image processing. We'll also discuss some of the key challenges and limitations of using PCA, and provide practical tips for implementing it in your own analyses.

important pattern and relationship, pattern and relationship, pca, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

A Beginner's Guide to Principal Component Analysis

#artificialintelligenceDec-25-2022, 00:15:09 GMT

In principal component analysis, a principal component is a new feature that is constructed from a linear combination of the original features in a dataset. The principal components are ordered such that the first principal component has the highest possible variance (i.e., the greatest amount of spread or dispersion in the data), and each subsequent component in turn has the highest variance possible under the constraint that it is orthogonal (i.e., uncorrelated) to the previous components. The idea behind PCA is to reduce the dimensionality of a dataset by projecting the data onto a lower-dimensional space, while still preserving as much of the variance in the data as possible. This is done by selecting a smaller number of principal components that capture the most important information in the data, and discarding the remaining, less important components. In this way, PCA can be used to identify patterns and relationships in high-dimensional data, and to visualize data in a lower-dimensional space for easier interpretation.

lower-dimensional space, principal component, principal component analysis, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.66)

Add feedback

ML

#artificialintelligenceNov-27-2022, 19:31:02 GMT

T-distributed Stochastic Neighbor Embedding (t-SNE) is a nonlinear dimensionality reduction technique well-suited for embedding high-dimensional data for visualization in a low-dimensional space of two or three dimensions. Dimensionality Reduction is the technique of representing n-dimensions data(multidimensional data with many features) in 2 or 3 dimensions. An example of dimensionality reduction can be discussed as a classification problem i.e. student will play football or not that relies on both temperature and humidity can be collapsed into just one underlying feature, since both of the features are correlated to a high degree. Hence, we can reduce the number of features in such problems. A 3-D classification problem can be hard to visualize, whereas a 2-D one can be mapped to simple 2-dimensional space and a 1-D problem to a simple line.

conditional probability, dimensionality reduction, similarity, (1 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Understanding dimensionality reduction in machine learning models

#artificialintelligenceMay-18-2021, 01:25:06 GMT

Machine learning algorithms have gained fame for being able to ferret out relevant information from datasets with many features, such as tables with dozens of rows and images with millions of pixels. Thanks to advances in cloud computing, you can often run very large machine learning models without noticing how much computational power works behind the scenes. But every new feature that you add to your problem adds to its complexity, making it harder to solve it with machine learning algorithms. Data scientists use dimensionality reduction, a set of techniques that remove excessive and irrelevant features from their machine learning models. Dimensionality reduction slashes the costs of machine learning and sometimes makes it possible to solve complicated problems with simpler models. Machine learning models map features to outcomes.

dataset, dimensionality reduction, target variable, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (0.86)

Add feedback

Machine learning: What is dimensionality reduction?

#artificialintelligenceMay-16-2021, 21:43:57 GMT

dataset, dimensionality reduction, target variable, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (0.86)

Add feedback

Cluster Analysis of High-Dimensional scRNA Sequencing Data

Long, Jiawei, Xia, Yu

arXiv.org Machine LearningDec-18-2019

With ongoing developments and innovations in single-cell RNA sequencing methods, advancements in sequencing performance could empower significant discoveries as well as new emerging possibilities to address biological and medical investigations. In the study, we will be using the dataset collected by the authors of Systematic comparative analysis of single cell RNA-sequencing methods. The dataset consists of single-cell and single nucleus profiling from three types of samples - cell lines, peripheral blood mononuclear cells, and brain tissue, which offers 36 libraries in six separate experiments in a single center. Our quantitative comparison aims to identify unique characteristics associated with different single-cell sequencing methods, especially among low-throughput sequencing methods and high-throughput sequencing methods. Our procedures also incorporate evaluations of every method's capacity for recovering known biological information in the samples through clustering analysis.

count matrix, ecember 19, matrix, (14 more...)

arXiv.org Machine Learning

1912.084

Country: North America > United States > California > Los Angeles County > Los Angeles (0.31)

Genre: Research Report (0.83)

Industry: Health & Medicine (0.88)

Technology:

Information Technology > Data Science (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.72)

Add feedback